Stochastic Order Induced by a Measurable Preorder by
نویسنده
چکیده
Kamae, et. al. [8, Theorem 1] presents a general characterization of the partial ordering of probability measures induced by a closed partial ordering on the underlying Polish state space. A preorder is a reflexive and transitive, but not necessarily antisymetric relation. This paper presents a similar characterization of the preordering of probability measures induced by a measurable preordering on the underlying Polish space. We then apply this result to obtain a characterization of stochastic majorization, the preorder induced by the widely applied majorization preorder on Euclidean space. We show that a multifunction associated with this preorder is compact and convex valued and continuous, and hence satisfies the hypotheses of our characterization. The continuity properties of the majorization ordering and the induced stochastic majorization ordering have not been widely recognized and are of interest in their own right. Kamae, et. al. [8, Theorem 1] presents a general characterization of the partial ordering of probability measures induced by a closed partial ordering on the underlying Polish state space. A preorder is a reflexive and transitive, but not necessarily antisymetric relation, and so is a weaker kind of relation than a partial order. The intention here is to present a similar characterization of the preordering of probability measures induced by a measurable preordering on the underlying Polish space. Marshall and Olkin [9, Ch. 17.C] already hint that Kamae, et. al. [8, Proposition 1] is “. . . a much more general version involving general preorders.” Specifically, we exploit the properties of the preorder and theorems of Strassen [11] and Himmelberg and Van Vleck [6] to provide a Kamae, et. al. like characterization of the induced stochastic preorder. We then apply this result to obtain a characterization of stochastic majorization, Marshall and Olkin [9, Ch. 11], the preorder induced by the widely applied majorization preorder on Euclidean space. We show that a multifunction associated with this preorder is compact and convex valued and continuous, and hence satisfies the hypotheses of our characterization. The continuity properties of the majorization ordering and the induced stochastic majorization ordering have not been widely recognized and are of interest in their own right. A Borel space S is a Borel subset a Polish space, a complete separable metric space. A multifunction from S to a set T is a function with domain S with value F s a nonempty subset of T , for each sS . It is Borel measurable if 1 : F B s F s B S is a Borel set in S for each closed subset B of T . Let R be a Polish space. Throughout R is assumed to be endowed with a preorder, denoted by , a reflexive and transitive relation on R . For each xR , let : x y y x R . By reflexivity, x x , and by transitivity, y x implies that y x . Thus is a multifunction from R to R . We say the preorder is a measurable preorder if the multifunction is Borel measurable. We assume that is a compact valued and Borel measurable. For compact valued multifunctions, Borel measurability has many equivalent definitions. See [7, Theorem 3]. Let , P P R R R , denote the set of probability measures on R , respectively on 2 R R R , endowed with the topology of weak convergence. In this case, , P P R R R are both Polish spaces (in the Prohorov metric, [2, Theorem 6.8]). For P R denote by supp the support of , the smallest closed subset of R with measure one, [3, p. 18]. For each xR , let : supp x P x R and let x P R be the probability measure such that supp x x . Theorem 1. For each xR , x x and y x implies that y x . is a compact and convex valued Borel measurable multifunction from R to P R . Proof. The first statement follows from the same properties of . Thus is a multifunction. For each xR , x is convex, closed and tight. By Prohorov’s theorem, [2, Theorem 5.1], x is compact. That is Borel measurable follows from Himmelberg and Van Vlect [6, Theorem 3 (ii)]. Let F denote the collection of real valued Borel measurable functions on R that are increasing (nondecreasing) in the preorder on R , i. e., for : f R R , R the real line, f Borel measurable, f F if and only if for , x yR and x y , f x f y . We extend the preorder to P R as follows. For , P R , say that is larger in this extended preorder than , and write , if and only if fd fd for every f F for which both these integrals exist. Then our extension of to P R is truly an extension since for , x yR , x y in R if and only if x y in P R . Intuitively, if puts more weight on elements that are less extreme in the relation on R than does . This intuition is formalized by the characterization of on P R given below (Theorem 2). This definition of on P R is typical in the literature on stochastic orderings ([8] and [9, 17.A.3], but there are many others. See [10, Chs. 1, 4]. There are many definitions given in terms of R valued random quantities say X and Y . For example, one version of stochastic majorization of interest here is the relation 1 E in [9, Ch. 11]. This relation, denoted 1 E , is stated as 1 E X Y , Y stochastically majorizes X in the sense of 1 E , if E f X E f Y for all f F for which these expectations exist. In [9, Ch. 11], F is the cone of Borel measurable Schur convex functions. It is easy to see that this is equivalent to the above definition since these expectations are given by integration with respect to the distributions in R of these random quantities and given these distributions there are R valued random quantities with these distributions. There is a another definition of stochastic majorization that Marshall and Olkin [9, pp. 282-283] call 1 P that implies 1 E and appears ostensibly to be stronger than 1 E . There 1 P X Y if st f X f Y for all f F , where st is the typical meaning of stochastically larger, [10, 1.A]. Clearly 1 P implies 1 E since stochastically larger random variables have larger expectations. It turns out that in this particular case we also have 1 E implies 1 P . See the argument in [9, top of p. 283]. We will use this argument to show one part of the characterization of the relation in P R defined above. Here the orders 1 E and 1 P are the orderings as defined above, but in the abstract setting of this paper. Let B denote the Borel subsets of R . A Markov kernel on R is a map : [0,1] m R B such that for each set BB the map , x m x B is Borel measurable and for xR fixed , m x m x P R . For such a Markov kernel m and a probability measure P R denote by m the element of 2 P R defined by , A m A B m x B dx , for measurable rectangles, , A BB . We say that the first marginal of m is and denote the second marginal m . Finally, we say that a set B R is increasing if its indicator function belongs to F (necessarily, then BB ). These designations are borrowed from [8, pp. 899-900]. The following is the desired characterization of on P R and flushes out the intuition given above. Theorem 2. For , P R the following are equivalent: (i) ; (ii) There exists a Markov kernel m on R such that m and m x x , almost every xR ; (iii) There exists a probability measure 2 P R with K 1 with first marginal and second marginal ; (iv) There exists a real valued random variable Z and two measurable functions , : f g R R with f g (i. e., , f t g t t R ) such that the distribution of f Z is and the distribution of g Z is ; (v) There exist R valued random variables Y and X such that 1 P X Y and the distribution of X is and the distribution of Y is ; (vi) B B for every increasing set BB . Proof: The key equivalence is (i) and (ii). The rest follow easily. Let , P R and assume that (i) holds. For every bounded continuous function : z R R define , sup : h x z zd x . By Theorem 1 and [7, Theorem 2], , h z is Borel measurable in x , and for each xR , , sup z x h x z z since x x and x zd z x . Thus , h z is bounded as well, so all integrals below exist. Finally, , h z F , for if , x yR and x y then by Theorem 1, x y , and hence , , h x z h y z . It follows that , , zd h x z dx h x z dx , the last inequality from (i). Condition (ii) then follows from [11, Theorem 3]. Assume (ii) and let m . Since is Borel measurable, its graph K is a Borel subset of 2 R , [7, Theorem 3]. Then , 1 x m x dx , since the x -section x x for every x . This gives (iii). Therefore assume (iii). The construction in [8, Theorem 1(iii)] goes through here as well and this gives (iv). Assuming (iv) let X f Z and Y g Z . Then clearly 1 E X Y and (v) follows from the fact that 1 1 E P X Y X Y , since F satisfies [9, (3) p. 283]. Therefore assume (v). If BB and the indicator B I F , then B B B E I X E I Y B , which is (vi), where the inequality follows from the fact that st B B I X I Y . It remains to show that (vi) imples (i). Assume (vi). For f F , : x f x t I R F for all real t . It follows from (vi) that : : x f x t x f x t R R . Since f F , 0 0 : : f d x f x t dt x f x t dt f d R R (the equalities here hold by [1, (4), p. 223]). Also f F implies that f F , and we get that : : x f x t x f x t R R . As a result, 0 0 : : f d x f x t dt x f x t dt f d R R (the equalities here again by [1, (4), p. 223]). If both the integrals , fd fd exist, it follows that fd fd , which is (i) . The crucial implication (i) implies (ii) in Theorem 2 relies on Theorem 3 of Strassen [11]. In obtaining essentially the same implication Kamae, et. al. [8, Theorem 1, (i) implies (ii)] and [9, 17.B.1] use Theorem 11 of Strassen [11], which requires that the graph K of the multifunction be closed. Our assumption of measurability of yields the weaker condition that K is a measurable set. Of course if K is closed, as it is in the majorization application to follow, then for the measure in Theorem 2(iii), supp K . All that is required to use Theorem 3 of [11] is that the preorder be sufficiently regular to give Borel measurability in x of the function , h x z , defined above in the first paragraph of the proof of Theorem 2. Some measurability of the multifunction seems essential to this result, but weaker conditions than compact valuedness of may give the result. See for example [5, Proposition 3, p.60]. Theorem 2 of [7], however, is very handy. Transitivity of the preorder gives monotonicity of , h z in the preorder. Reflexivity of the preorder gives x x and x is convex whether x is or not. This convexity is essential to apply [11, Theorem 3], but it comes at no cost. The result Theorem 2(ii) formalizes our intuition expressed above that if puts more weight on elements that are less extreme in the relation on R than does . The Markov kernel m of Theorem 2(ii) is such that for almost every x , , 1 m x x . In this sense m shifts weight of to elements less extreme in the relation on R . Borrowing from the language suggested in [8, top of p. 900], the kernel m might be termed “downward.” Let us now consider the application to majorization. Let 1, , n x x x and 1, , n y y y be n-tuples of real numbers and let 1 , , n x x x and 1 , , n y y y denote the vectors x and y with coordinates rearranged in decreasing order, i. e., 1 n x x and 1 n y y . The vector x is majorized by the vector y (or y majorizes x ), written x y , if for each 1, , k n , 1 1 k k i i i i x y with equality holding for k n , [9, A.1, p. 7]. In words, x is majorized by y if the components of x are more evenly spread out than the components of y or the components of y are more concentrated than the components of x . This intuition is reinforced by noting the following. Let 1, ,1 e , the n-tuple whose coordinates are all equal to one. Then for a vector x the inner product x e is the sum of the components of x . Let , , x x e n x e n and let (0, , ,0, ,0) k x x e , where x e appears in the th k component. The vectors x , x , and k x all have the same total sum of components, but the components of x are more evenly spread out than those of x . Clearly k x concentrates this sum in one component. In this sense, x is the most evenly spread of this sum of components and k x is the most concentrated of this sum. Indeed, we have that k x x x , 1, , k n . We note that the majorization relation is reflexive and transitive (established below in Lemma 5) and hence is a preordering. It is not a partial ordering, however, since it is not antisymmetric. Indeed it is symmetric in that ' x x , where ' x is any permutation of x . Let n R denote n-dimensional Euclidean space. All topological properties in the sequel will be with respect to the usual metric on n R . Let Π denote the set of n n permutation matrices and let D denote the set of n n doubly stochastic matrices. Then M Π if and only if there is one one in each row and each column of M and all other entries are zero. Similarly, M D if and only if the entries in M are nonnegative and each row and each column sum to one. Theorem 3. For , n x y R , the following are equivalent: (i) x y ; (ii) x yD , some DD ; (iii) i i i x y , for some 0 i , 1 i i , and some i Π . Proof: The equivalence of (i) and (ii) is due to Hardy, Littlewood and Polya. See [9, Theorem 2.B.2]. The equivalence of (ii) and (iii) is due to Birkhoff. See [9, Theorem 2.A.2]. For each n x R let : n x y R y x , the set of n-tuples that are majorized by x . For a picture of this set in the case n = 3 see [9, Figure 3, p. 9]. Let , : n n y x R R y x , the graph of the multifunction . In the following, the notions of upper and lower semicontinuity in [6] are the same as the notions of upper and lower hemi-continuity in [5], which gives convenient characterizations of these notions in terms of sequences in n R . We will use the terminology of [6], but refer to these convenient characterizations in [5]. A multifunction is continuous if it is both upper and lower semicontinuous. The following are properties of . Theorem 4. is a compact convex valued continuous multifunction in n R . Consequently is closed in 2n R . Proof: Clearly for each n x R , x x , so is a multifunction. x is convex, by Theorem 3(ii) (convex combinations of doubly stochastic matrices are doubly stochastic) and compact since, by Theorem 3(iii), it is the convex polyhedron generated by the finite number of permutations of x . Suppose n x R and n k x R with limk k x x . If y x , by Theorem 3(ii), y xD , some DD . Then again by Theorem 3(ii), k k k y x D x and limk k y y , so is lower semicontinuous, [5, Theorem 2, p. 27]. Let k k k k y x D x with k D D arbitrary. By Birkhoff’s theorem D is the convex polyhedron generated by the permutation matrices and hence is compact in 2 n R . Thus there is a subsequence of the k D that converges to an element DD . For this subsequence indexed by ' k , ' ' limk k y xD x . Thus is upper semicontinuous, [5, Theorem 1, p. 24]. The closure of then follows by the same result . As obvious as these properties of are, except for convexity of x , they appear nowhere prominently in the literature on majorization to this author’s knowledge. Eaton and Perlman [4] do exploit the upper semicontinuity of in the proof of their Lemma 4.1. The following result establishes the transitivity of majorization and is needed later for the characterization of stochastic majorization. Lemma 5. For , n x y R , if y x , then y x . Proof: For , n x y R , suppose y x and z y . Then by Theorem 3(ii) ˆ z yD and y xD some ˆ , D DD . But then ˆ z xDD and ˆ DDD , [9, 2.A.3, p. 20]. Again by Theorem 3(ii), z x . As in the abstract case, for each n x R , let ( ) : supp n x P R x . Let , ( ) : n n x R P R x , the graph of the multifunction . Theorem 6. is a compact and convex valued continuous multifunction from n R to ( ) n P R . The graph is closed in ( ) n n R P R . Proof: For n x R , x x . This result is then Theorem 1 above, but specialized to the example here. By [6, Theorem 3(i)], inherits the continuity of established in Theorem 4. The closure of follows from [5, Theorem 1, p. 24]. The functions : n f R R that are increasing (non-decreasing) in the majorization relation are called Schur-convex. See [9, Ch. 1.D, Ch. 3] for the origins of this terminology and the characterizations of this class of functions. Denote by SC the class of Borel measurable Schur-convex functions. The measurability requirement is a restriction, [9, 3.C.4, p. 70]. We can extend the relation in n R to a relation in ( ) n P R as we did above by taking SC F . This relation in ( ) n P R is the version of stochastic majorization 1 E studied in [9, Ch. 11]. Here the relations 1 E and 1 P are the relations in [9, Ch. 11]. Let n B denote the Borel sets in n R and call n B B Schur convex if B I SC . The following characterization of in ( ) n P R results. Theorem 7. For , n P R the following are equivalent: (i) ; (ii) There exists a Markov kernel m on n R such that m and m x x , almost every n x R ; (iii) There exists a probability measure 2n P R with supp K with first marginal and second marginal ; (iv) There exists a real valued random variable Z and two measurable functions , : n f g R R with f g (i. e., , f t g t t R ) such that the distribution of f Z is and the distribution of g Z is ; (v) There exist n R valued random variables Y and X such that 1 P X Y and the distribution of X is and the distribution of Y is ; (vi) B B for every Schur convex set n B B . Proof: This result follows from Theorem 2 by noting that by Theorem 4, iscompact valued and the graph K is closed and hence is a Borel set, which by [7,Theorem 3] implies that the multifunction is Borel measurable. References1. P. Billingsley, “Convergence of Probability Measures,” Wiley, New York, 1968.2. P. Billingsley, “Convergence of Probability Measures,”2, Wiley, New York, 1999.3. J. L. Doob, “Measure Theory,” Springer-Verlag, New York, 1994.4. M. L. Eaton and M. D. Perlman, Reflection groups, generalized Schur functions, andthe geometry of majorization, Annals of Probability, 5 (1977), 829-860.5. W. Hildenbrand, “Core and Equilibria of a Large Economy,” Princeton UniversityPress, Princeton, 1974.6. C. J. Himmelberg and F. S. Van Vleck, Multifunctions with values in a space ofprobability measures, Journal of Mathematical Analysis and Applications, 50 (1975),108-112.7. C. J. Himmelberg, T. Parthasarathy, and F. S. Van Vleck, Optimal plans for dynamicprogramming problems, Mathematics of Operations Research, 1 (1976), 390-394.8. T. Kamae, U. Krengel, and G. L. O’Brien, Stochastic inequalities on partially orderedspaces, Annals of Probability, 5 (1977), 899-912.9. A. W. Marshall and I. Olkin, “Inequalities: Theory of Majorization and ItsApplications,” Academic Press, New York, 1979.10. M. Shaked and J. G. Shanthikumar, “Stochastic Orders and Their Applications,”Academic Press, New York, 1994.11. V. Strassen, The existence of probability measures with given marginals, Annals ofMathematical Statistics, 36 (1965), 423-439.
منابع مشابه
Stochastic averaging for SDEs with Hopf Drift and polynomial diffusion coefficients
It is known that a stochastic differential equation (SDE) induces two probabilistic objects, namely a difusion process and a stochastic flow. While the diffusion process is determined by the innitesimal mean and variance given by the coefficients of the SDE, this is not the case for the stochastic flow induced by the SDE. In order to characterize the stochastic flow uniquely the innitesimal cov...
متن کاملTOPOLOGICAL SIMILARITY OF L-RELATIONS
$L$-fuzzy rough sets are extensions of the classical rough sets by relaxing theequivalence relations to $L$-relations. The topological structures induced by$L$-fuzzy rough sets have opened up the way for applications of topological factsand methods in granular computing. In this paper, we firstly prove thateach arbitrary $L$-relation can generate an Alexandrov $L$-topology.Based on this fact, w...
متن کاملSimulation for Continuous-Time Markov Chains
This paper presents a simulation preorder for continuoustime Markov chains (CTMCs). The simulation preorder is a conservative extension of a weak variant of probabilistic simulation on fully probabilistic systems, i.e., discrete-time Markov chains. The main result of the paper is that the simulation preorder preserves safety and liveness properties expressed in continuous stochastic logic (CSL)...
متن کاملEquilibrium Points of Logarithmic Potentials Induced by Positive Charge Distributions. I. Generalized De Bruijn-springer Relations
A notion of weighted multivariate majorization is defined as a preorder on sequences of vectors in Euclidean space induced by the Choquet ordering for atomic probability measures. We characterize this preorder both in terms of stochastic matrices and convex functions and use it to describe the distribution of equilibrium points of logarithmic potentials generated by discrete planar charge confi...
متن کاملPreorder Relators and Generalized Topologies
In this paper we investigate generalized topologies generated by a subbase of preorder relators and consider its application in the concept of the complement. We introduce the notion of principal generalized topologies obtained from the new type of open sets and study some of their important properties.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005